The language of collaborative tagging
نویسنده
چکیده
Collaborative tagging is the process whereby people attach keywords, known as tags, to digital resources, such as text and images, in order to render them retrievable in the future. This thesis investigates how tags submitted by users in collaborative tagging systems function as descriptors of a resource’s perceived content. Using computational and theoretical tools, I compare collaborative tagging with natural language description in order to determine whether or to what extent the former behaves as the latter. I start the investigation by collecting a corpus of tagged images and exploring the relationship between a resource and a tag using theories from different disciplines, such as Library Science, Semiotics and Information Retrieval. Then, I study the lexical characteristics of individual tags, suggesting that tags behave as natural language words. The next step is to examine how tags combine when annotating a resource. It will be shown that similar combinatory constraints hold for tags assigned to a resource and for words as used in coherent text. This realisation will lead to the question of whether the similar combinatory patterns between tags and words are due to implicit semantic relations between the tags. To provide an answer, I conduct an experiment asking humans to submit both tags and textual descriptions for a set of images, constructing a parallel a corpus of more than one thousand tags-text annotations. Analysis of this parallel corpus provides evidence that a large number of tag pairs are connected via implicit semantic relations, whose nature is described. Finally, I investigate whether it is possible to automatically identify the semantically related tag pairs and make explicit their relationship, even in the absence of supporting image-specific text. I construct and evaluate a proof-of-concept system to demonstrate that this task is attainable.
منابع مشابه
سیستم برچسب گذاری اجزای واژگانی کلام در زبان فارسی
Abstract: Part-Of-Speech (POS) tagging is essential work for many models and methods in other areas in natural language processing such as machine translation, spell checker, text-to-speech, automatic speech recognition, etc. So far, high accurate POS taggers have been created in many languages. In this paper, we focus on POS tagging in the Persian language. Because of problems in Persian POS t...
متن کاملAn improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملImpact of Grouping Type in Descriptve Collaborative Writings on Iranian EFL Learners' Written Grammatical Accuracy
The current study was an attempt to investigate the impact of grouping type on the grammatical accuracy of Iranian EFL learners in collaborative writing. Through administering the Michigan Test of English Language Proficiency, 64 female university students available participated in this study and were assigned to two groups--heterogeneous and homogeneous. The treatment process lasted 12 weeks o...
متن کاملTask Complexity Manipulation and EFL learners’ interactions in the process of collaborative pre-planning
Today, learners’ interaction and collaborative task performance have attracted increasing attention from language teachers and researchers. The present study investigated whether collaborative pre-planning, task complexity manipulation, and language proficiency level play a role in learners’ interactions. To this end, 128 EFL learners from two different language proficiency levels carried out t...
متن کاملIndividual and Collaborative Planning Conditions: Effects on Fluency, Complexity and Accuracy in L2 Argumentative Writing
Research on pre-task planning has revealed that planned conditions have produced more fluent and complex language than unplanned conditions. To date, most of these studies have investigated the effects of individual planning on language production while collaborative planning has received scant attention. To determine the effects of pre-task planning on second language written production, the p...
متن کاملCollaborative vs. Individual Task Planning and Iranian EFL Learners’ Writing Performance
Abstract This study was intended to compare collaborative and individual task planning effects on Iranian EFL learners’ writing performance. Therefore, a group of 90 upper intermediate EFL learners were placed in three groups of Collaborative Task planning, Individual Task planning, and Control. In the experimental groups, participants were asked to do task planning collaboratively and individu...
متن کامل